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Altering Dough Viscoelasticity with Modified Glut nina 

INTRODUCTION 

Field of the Invention 

The invention relates to changing mechanical properties of 
wheat dough used to make products like breads and noodles by using 
modified glutenin proteins. 

Background 

Wheat cultivation began somewhere in the middle east and has 
been ongoing for perhaps as long as 10,000 years. Today, wheat 
rice and maize form the foundation of the human diet in every 
corner of the world. Of these, wheat is the most widely grown, 
and because of its high protein content, wheat is the single 
greatest source of protein in human diet. 

Wheat is also unique because the mixture of flour and water 
produces a dough with unusual physical properties such as 
viscoelasticity. The rheological properties of dough can be 
attributed to many flour components, but the clearest association 
is with the storage proteins of the grain and flour. These 
proteins are historically divided into several families, of which 
the high molecular weight (HMW) glutenins, correlates best with 
dough viscoelasticity and bread-making quality. The deduced amino 
acid sequences of HMW glutenins reveal a dominant central 
repetitive domain composed of 45-90 repeats of two or three simple 
peptide motifs and non- repetitive terminal domains containing one 
or more cysteine residues. 

It is believed that fundamental to dough functionality is 
the integrity of a highly cross -linked matrix formed by 
intermolecular disulfide linkages. For example, disruption of 
this network, such as by partial reduction, leads to loss of dough 
functional properties. Hence, the number and placement of 
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cysteines sites in the HMW glutenin termini have been suggested 
to be determinants of viscoelastic properties of dough. 

Comparisons of the HMW glutenins correlated with good and 
poor quality have also pointed to non -cysteine residues as being 
important viscoelastic determinants. For example, a glutenin 
known as subunit 10 is associated with good viscoelasticity and 
bread making quality, while subunit 12 is associated with poor 
viscoelasticity and bread making quality. These proteins are 
nearly identical with just twelve single amino acid substitutions, 
two hexamer repeat deletions (in subunit 10) , one 2-mer addition 
and one 2-mer deletion. Because of the two hexamer repeat 
deletions in subunit 10, it is slightly smaller in molecular 
weight than 12. Goldsbrough et al . (1989) Biochem J. 263, 837-842 
have reported that the underlying conformational differences 
between these two similar proteins are due to a short region 
containing only six amino acid difference, none cysteines. 

Relevent Literature 

Anderson (1994) in Improvement of Cereal Quality by Genetic 
Engineering, Eds. Henry and RolandB, Plenum Press, NY, provides 
general background on wheat genetic engineering. Goldsbrough et 
al. (1989) Biochem J. 263, 837-842 report that conformational 
differences between two HMW subunits are due to a short region 
containing six amino acid differences. Bekes et al. (1994) J 
Cereal Science 19, 3-7 describe the use of a 2g Mixograph for 
measuring the effects of a HMW glutenin subunit on dough mixing 
properties. Bekes et al. (1996), in Improvement of Cereal Quality 
by Genetic Engineering, Eds. Henry and Rolands, Plenum Press, NY, 
describe the contributions to mixing properties of HMW glutenins 
expressed in bacterial systems. Shani et al. (1992) Plant 
Physiol. 98, 433-441 report on the role of amino and carboxyl- 
terminal regions in the folding and oligomerization of HMW 



- 2 - 



* 

WO 98/08607 



PCT/US97/15398 



glutenin subunits. Blechl et al. (1996) Nature Biotechnology 14, 
875-883, report the expression of a novel hybrid high molecular 
weight glutenin subunit gene in transgenic wheat. 

SUMMARY OF THE INVENTION 

The invention provides methods and compositions for 
producing dough, preferably wheat dough, with particular 
viscoelastic properties and the use of such dough in products such 
as breads and noodles. Viscoelastic variation is effected by 
using flour milled from seed containing non-natural HMW glutenin 
subunits. In particular, HMW glutenin subunits comprising a non- 
natural repetitive domain having repeats which are non-natural in 
number and/or synthetic in sequence are used to control 
viscoelastic properties of dough. It is demonstrated that while 
controlling for termini and repeat sequence, modifying repeat 
number may be used to change dough properties such as 
viscoelasticity. Hence, the repetitive domain is preferably of 
a non-natural glutenin subunit length, more preferably greater 
than a natural length. 

The invention also provides recombinant genes encoding the 
subject HMW glutenin subunits, plants and seeds comprising such 
genes, and methods of making such plants by genetic engineering. 

The genes may encode natural repeat sequences recombined to 
encode proteins having preferably non-naturally long, repetitive 
domains. The genes may also contain non-synthetic or heterologous 
sequences which may be selected for preferential expression, 
secretion, etc. in a given host. The invention further provides 
doughs and other wheat products with modified viscoelastic 
properties and methods of making such product using the disclosed 
compositions . 
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BRIEF DESCRIPTION OF THE FIGURE 

Figure 1 shows the relationship between repetitive domain length 
and dough mixing time. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention uses novel, modified glutenin subunits to 
change one or more viscoelastic properties of dough or to produce 
dough or dough antecedents such as flour or grain having such a 
modified glutenin subunit. The modified subunits may be 
"introduced" into the dough by adding the modified subunit 
directly to the dough. Preferably, such directly added subunit 
is "incorporated" into the higher order glutenin structure by 
reduction/oxidation reactions (see, e.g. Bekes et al. (1994) J 
Cereal Sci 19, 3-7) . Changing a viscoelastic property of a dough 
may also be effected by modifying dough starting material, such 
as flour or seed, to contain the modified subunit. For example, 
the modified subunits may be "introduced" into the dough by 
genetically transforming a plant, preferably a wheat plant, with 
a functional transgene encoding the modified subunit. The change 
in viscoelastic property is generally in relation to a 
corresponding dough made with a natural glutenin subunit or 
glutenin subunits which differ in terms of repeat copy number. 

As used herein, a HMW glutenin subunit comprises an N- 
terminal, preferably non- repetitive, domain including a cysteine 
residue capable of forming an intermolecular disulfide bond with 
another HMW glutenin subunit, a LMW glutenin or a gliadin, an 
internal repetitive domain comprising a plurality of HMW glutenin 
subunit repeats, and a C-terminal domain, preferably non- 
repetitive, also including a cysteine residue capable of forming 
an intermolecular disulfide bond. HMW glutenins subunit repeats 
comprise the hexamer P/SGQGQQ (SEQ ID NO:l), preferably PGQGQQ 
(SEQ ID NO: 2), the nonamer G Y / HYPT / AS P / LQQ (SEQ ID N0:3), 
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preferably GY/HYPTSPQQ (SEQ ID NO:4), more preferably GYYPTSPQQ 
(SEQ ID:NO:5) and the hexamer plus trimer P/SGQGQQ (SEQ ID NO:l) 
+ GQQ/ preferably PGQGQQ (SEQ ID NO; 2) + GQQ. In a particular 
embodiment, a hexamer and nonamer repeat are joined to form 15-mer 
repeats . The hexamer and nonamer repeats may be combined in a 
variety of orders and numbers and capped with N- and C- termini to 
form HMW glutenin subunits of virtually any predetermined 
sequence. Preferred modified glutenins have from 10 to 150 
repeats, preferably 15-mer repeats. The repetitive domain may be 
constructed from recombined native sequence or from synthetic 
sequence, which may be engineered for optimal expression is a 
chosen host, the incorporation of preferred restrictions sites, 
etc . 

Selection of repeats, repetitive regions, and termini are 
generally made according to desired properties of the resultant 
dough. For example, holding termini and repeat structure 
constant, an unexpected linear relationship between repetitive 
domain length and dough mixing time is demonstrated, as shown in 
Figure 1. Data points 1-8 are, respectively, the fused termini 
with no intervening repetitive domain, a 16-repeat synthetic 
glutenin as described in the experimental section below, Dx5-R441 
(below), a 32-repeat synthetic glutenin, Dx5-R576, the native Dx5, 
a 48-repeat synthetic glutenin, and Dx5-R853. Hence, 
predetermined viscoelastic properties are provided by selecting 
a repetitive domain length. 

A wide variety of other sequences and motifs may be 
introduced into the repetitive domain to effect desired 
conformational and/or functional properties. For example, elastic 
properties may be modified by including polymeric sequences from 
structural proteins comprising repetitive motifs such as elastin, 
collagen, insect silk, etc. 

Similarly, the termini may be engineered to provide any of 
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a variety of properties to effect desired conformational and/or 
functional properties, to aid in expression, secretion or 
purification (see, e.g. McPherson et al. (1992) Biotechnol. Prog. 
8, 347-352), etc. For example, the number and position of 
cysteine residues available for intermolecular disulfide linkage 
may be varied, physical, e.g His6, or epitope tags may be 
provided, etc. 

The following experimental section is offered by way of 
illustration and not by way of limitation. 

EXAMPLES / EXPERIMENTAL 

Example 1 

a) Structure of the repetitive domain modified Dx5 genes 

Our 1Dx5 construct, pET-3a-Dx5 f contained the entire 1Dx5 
coding region except of the signal peptide. The coding region of 
this construct was 2484 bp including the initial codon ATG and 
produced a polypeptide consisting of 828 amino acids, 696 of which 
represented the repetitive domain. 

The pET-3a-DX5-853 (R853) construct contained the DrdI.EspI 
blunt ended fragment inserted into the DrdI blunt ended site of 
the pET-3a-Dx5. The ExpI/DrdI fragment was 471 bp long and coded 
for 157 amino acids , The addition of this segment generated an 
in frame codon, GCA, coding for an alanine residue. The HMW 
glutenin subunit produced by R853 was 19% larger than that 
expressed by the normal pET-3a-Dx5, and contained a repetitive 
domain that was 22.5% larger than that present in the Dx5 subunit. 

The pET-3a-Dx5-576 (R576) construct was generated by 
removing the Hindlll/DrdI fragment from the repetitive region of 
the !Dx5 gene. The removal of this fragment and subsequent blunt 
end reaction generate an in frame CAA codon which codes for a 
glutamine residue. The protein produced by the 576 was 14.5% 
shorter than the naturally-expressed subunit and contained a 
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repetitive domain that was 17.2% shorter than that present in the 
Dx5 subunit. 

The pET-3a-Dx5-441 (R441) construct was made by removing the 
Expl/Ncol fragment from the repetitive domain of the 1Dx5 gene. 
The removal of this fragment and subsequent blunt end reaction 
generated an in frame GCG codon, which codes for an alanine 
residue. The HMW glutenin subunit produced by R441 was 3 0.7% 
shorter than the naturally-expressed subunit and contained a 
repeating domain that was 36.6% shorter than that present in the 
Dx5 subunit . 

b) Escherichia coli expression and analysis 

Bacteria harboring pET-3a-Dx5 or the modified constructs 
were induced by adding IPTG in the growth medium. An aliquot of 
the culture was taken 16 h after induction and analysis by SDS- 
PAGE showed the presence of an additional band in all four 
constructs compared with the induced cells harboring the control 
pET-3a. The additional bands expressed in R853, R576 and R441.had 
apparent molecular weights of about 142 kDa, 104 kDa and 82 kDa. 

The identity of these bands was confirmed by Western blot 
analysis. 

By restricting the variations to the repetitive domain 
repeat number, we were able to directly investigate the role of 
repeat number on subunit conformation and function. For example, 
comparison between actual (calculated) and apparent (gel mobility) 
of all plasmid constructs showed that the characteristic slower 
migration of these proteins increased (i.e. was further retarded) 
as a linear function of the repeat length. In fact, the linear 
relationship allowed us to calculate the apparent added MW per 
amino acid residue; specifically, each amino acid in the internal 
repeat accounts for an increase in MW of about 45 compared with 
actual MW. Note that this finding was only made possible by the 



- 7 - 



WO 98/08607 



PCT/US97/15398 



mutational analysis disclosed herein and can not be inferred from 
analysis of natural glutenin alleles which do not control for non- 
repetitive regions. For example, Dx2 migrates slower than even 
though its molecular size is smaller. DylO and Dyl2 also exhibit 
such an inverse correlation between size and migration 
retardation. 

In contrast to the retardation patterns, the deletions and 
insertions in the repetitive domain did not result in any 
significant modification of the isoelectric point (pi) value for 
the various mutants. These results are consistent with the nature 
of the repetitive domain, which is low in charged amino acid 
residues. 

The mutants were also analyzed by RP-HPLC analysis revealed 
that the Dx5 subunit expressed in E.coli had the same retention 
time as the subunit purified from flour. The elution times of the 
modified subunits showed an inverse correlation with their size. 

R853, the largest construct eluted at 30,6 min, R576 eluted at 
32.4 min, and R441 eluted at 33.4 min. Again, this finding is 
contrary to the inference that would be derived from elution data 
for natural HMW glutenin subunits which do not elute in order of 
decreasing size. (See, Margiotta et al . (1993) J Cereal Science 
17, 221-236 and Marchylo et al. (1992) Cereal Chemistry 69, 371- 
378) . 

Example 2 

a) Structure of the synthetic HMW-glutenin gene 

The most important structural features of the HMW-glutenins, 
both from theoretical structure/ function considerations and for 
the construction of genes de novo, are the terminal, 
cysteine -containing domains, and the central repetitive domain. 

The central domain is composed of 45-90 copies of short peptide 
motifs which are believed to form a regular higher order 



- 8 - 



WO 98/08607 



PCT/US97/15398 



structure, although the exact form of the structure is not known. 
To understand the molecular basis of HMW-glutenin functioning a 
strategy was developed for assembling synthetic HMW-glutenin genes 
with four features; The first feature is the separate 
construction of fused terminal domains into which can be inserted 
the separately constructed repetitive domains. This simplifies 
the construction by allowing assembly of the two disparate 
sections by different strategies, and allows "mixing-and-matching" 
of terminal regions and repetitive domains from the eventual sets 
of available constructions. 

The construct containing the fused terminal domains was 
modeled on the Dx5 HMW-glutenin gene (Glu -Dl-ld; Anderson et al., 
1989, Nucleic Acids Res. 17, 461-462) which is highly correlated 
with wheat quality parameters and has been theorized to play a 
central role in dough visco-elasticity (Greene et al., 1989, Proc. 
Int. Wheat Gene. Swamp. 7th Cambridge 1, 735-740.). The DNA and 
amino sequences of the synthetic terminal construct are shown in 
SEQ ID NO: 6 and 7, respectively* The terminal construct does not 
encode the signal peptide cleaved off during protein processing 
in the wheat endosperm. 

The second feature of the synthetic HMW-glutenin construct 
is the placement of restriction sites useful in modifying cysteine 
residue number and position. This strategy was used by Ferret ti 
et al. (1986) Proc. Natl. Acad. Sci. USA 83 f 599-603, in the 
construction of a synthetic rhodopsin: 72 oligos, 15-40 bp long, 
included 28 unique restriction fragments about 60 bp apart for 
systematic structure/function studies. Similarly, Martin et al. 
(1995) Gene 154, 159-166, placed restriction sites at 270-300 bp 
intervals in their tropoelastin construct. The synthetic 
HMW-glutenin terminal sequence has 12 unique restriction sites 
(five additional sites occur in the vector but could be used, if 
necessary, by partial digest strategies) . For example, the first 
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13 amino acids, including the first cysteine residue, could be 
modified by replacement of the Ndel - Nrul DNA fragment. 

A site for a repetitive domain insertion is provided by the 
bases 340 to 357 at the Xbal and Spel sites. When the correct 
orientation of the repeat domain is inserted into the terminal 
domain construct the reading frame is intact throughout the three 
protein domains. 

The fourth feature in constructing a synthetic HMW-glutenin 
gene was to use codons correlated with highly expressed E. coli 
genes (Sharp et al.,1988, Nucleic Acids Res. 16, 8207*8211). At 
three sites a less than optimal (moderate frequency) codon was 
used to create the Nrul, MscI and Eco47III sites. At four sites 
base changes were used to create restriction sites and resulted 
in neutral amino acid changes: two Asp to Glu, one Glu to Asp, and 
one Thr to Ser exchange. Other exceptions were used in the 
repetitive domain where two less frequently used proline and 
serine codons were substituted to reduce oligo internal annealing 
and to create restriction sites. 

(b) Construction and expression of the terminal domains 

A DNA fragment encoding the two fused terminal domains was 
synthesized using PCR to assemble the final sequence from a set 
of oligos: 14 oligos of 40-68 bases with complementary ends, and 
a 15th oligo of 21 bases to serve as the baclcward primer. The 522 
bp BamHI +BstXI cut PCR product was cloned into pBluescript KS+ 
and a correct sequence clone was generated. This insert was then 
transferred to pET3a and expressed, confirming the presence of a 
functional translation context and reading frame. The induced 
terminal polypeptide (157 aa) typically comprises 10-25% of the 
total bacterial protein, and bands on SDS-PAGE at an apparent 
molecular weight of 27 kDa. . 
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(c) Construction of repetitive domain 

The first repetitive domain choaen to be constructed was 
based on the pentadecapeptide motif: PGQGQQGYYPTS PQQ (SEQ ID 
NO: 8) . Translation of the wheat motif sequence into the E.coli 
optimal expression codons showed no restriction sites. However, 
restriction ends could be generated by using the sequence: 
S PQQPGQGQQG YYPT (SEQ ID NO: 9) and substituting the moderately used 
Ser AGC codon for TCT. When two of these repeats are fused in the 
correct orientation, complete motifs are reformed, with a partial 
repeat at each end of the domain. The monomer (encoding the 15 
amino acid repeat motif) was then removed by a Nhel + Spel double 
digest, isolated by PAGE, and ligated at high insert concentration 
to the vector pUC13-NSpUC19 cut with the same two enzymes. Since 
all four ends in the ligation are compatible, all possible 
ligation orientations should occur. Multiple insertion clones cut 
with Nhel + Spel would restrict only where Nhel - Nhel or Spel - 
Spel ligations occurred. The head-to-tail ligations would destroy 
both sites and only DNA fragments with tandemly arrayed repeats 
would be released from the vector. This strategy was used to 
construct correct dimer inserts that were used in a second round 
to construct inserts with four, six, and eight excisable copies 
of the monomer in the same orientation. Sequencing confirmed the 
correct sequence of these multi-copy insertions. 

To build higher numbers of repeats a second procedure 
utilized the Nhel and Spel sites at the ends of the repeat 
polymer. This assembly strategy allows building repeat domains 
of any size possible from the available clones; i.e., 23 » 1 + 6 
+ 16, 48 e 16 + 32, etc. 

(d) Assembly and expression of complete synthetic gene 

The complete HMW-glutenin construct was assembled and 
expressed. We typically obtain HMW-glutenin comprising 10-20% of 
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total bacterial protein, or 15-3 0 mg per liter of culture. The 
synthesized protein has the same unusual extraction properties 
characteristic of the HMW-glutenins (solubility in dilute 
alcohols) , and its identity has been further confirmed by 
N-terminal sequencing. The completed construct expresses a 
protein which migrates under SDS-PAGE at approximately 101 kDa (as 
calculated from standard protein markers) , considerably higher 
than the MW of 71 kDa derived from the DNA sequence - such 
anomalous migrations characteristic of the HMW-glutenins . 

The foregoing experiments describe the procedures used to 
successfully assemble a synthetic HMW-glutenin gene. The new gene 
contains convenient restriction sites for future modification of 
the terminal domains {particularly the number and placement of 
cysteine residues) and the set of repeat polymer clones allows 
constructing 15 -AA motif repeat domains of any size, limited only 
by the ability of E.coli strains to maintain the repeated DNA. The 
construct supported synthesis of a novel HMW-glutenin protein at 
levels at least comparable to that of cloned wheat HMW-glutenin 
genes expressed in E.coli. 

All publications and patent applications cited in this 
specification are herein incorporated by reference as if each 
individual publication or patent application were specifically and 
individually indicated to be incorporated by reference. Although 
the foregoing invention has been described in some detail by way 
of illustration and example for purposes of clarity of 
understanding, it will be readily apparent to those of ordinary 
skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without 
departing from the spirit or scope of the appended claims. 
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MODIFIED GLUTENINS 

(iii) NUMBER OF SEQUENCES: 9 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: USDA, AGRICULTURAL RESEARCH SERVICE, 

PACIFIC WEST 

(B) STREET: 800 BUCHANAN STREET 

(C) CITY: ALBANY 

(D) STATE: CALIFORNIA 

(E) COUNTRY: USA 

(F) ZIP: 94710 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(BJ COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (813) 289-2966 

(B) TELEFAX: (813) 289-2967 

(2) INFORMATION FOR SEQ ID NO:l: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 44.. 1855 
(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 34.. 1845 
(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Gly Gin Gly Gin Gin 
1 5 
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(2) INFORMATION FOR SEQ ID NO; 2; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: G amino acids 

(B) TYPE; amino acid 

(CJ STRANDEDNESS: not relevant 
(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Pro Gly Gin Gly Gin Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO; 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY; not relevant 
(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY; Modif ied-si te 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note- "First Xaa is Tyr or His- 
second Xaa is Thr or Ala; third Xaa is Pro or Leu" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3; 

Gly Xaa Tyr Pro Xaa Ser Xaa Gin Gin 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note- "Xaa is Tyr 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Gly Xaa Tyr Pro Thr Ser Pro Gin Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Gly Tyr Tyr Pro Thr Ser Pro Gin Gin 
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12) -INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATGGAAGGTG AAGCGTCTGA ACAGCTGCAG TGCGATCGCG AACTGCAGGA ACTGCAGGAA 60 

CGTGAACTGA AAGCTTGCCA GCAGGTTATG GACCAGCAGC TGCGTGACAT CTCTCCGGAA 120 

TGCCACCCGG TTGTTGTTTC TCCGGTTGCT GGCCAGTACG AACAGCAGAT CGTTGTTCCG 180 

CCGAAAGGTG GTACCTTCTA TCCGGGTGAA ACCACTCCGC CGCAGCAGCT GCAGCAGCGT 240 

ATCTTCTGGG GCATCCCGGC TCTGCTGAAG CGCTACTACC CGTCTGTTAC TTGTCCGCAG 300 

CAGGTTTCTT ACTACCCGGG TCAGGCTTCT CCGCAGCGTT CTAGTTCTTC TTACCACGTT 360 

TCTGTTGAAC ACCAGGCTGC ATCTCTGAAA GTTGCTAAAG CTCAGCAGCT GGCTGCGCAG 420 

CTGCCGGCTA TGTGCCGCCT GGAAGGTGGT GACGCTCTGT CTGCTTCTCA GTGATAG 4 77 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met Glu Gly Glu Ala Ser Glu Gin Leu Gin Cys Asp Arg Glu Leu Gin 

5 !° IS 

Glu Leu Gin Glu Arg Glu Leu Lys Ala Cys Gin Gin Val Met Asp Gin 

25 30 
Gin Leu Arg Asp lie Ser Pro Glu Cys His Pro Val Val Val Ser Pro 

35 4 ° 45 

Val Ala Gly Gin Tyr Gl u G ln Gln „. Val ^ prQ ^ ^ 

55 60 
Thr Phe Tyr Pro Gly Glu Thr Thr Pro Pro Gin Gin Leu Gin Gin Arg 

He Phe Trp Gly lie Pro Ala Leu Leu Lys Arg Tyr Tyr Pro Ser Val 



80 

e Pro Ala Leu Leu Lys Ara Tvr Tvr d*-~ e_ 

85 

Thr cys Pro Gin Gin Val Ser T yr Tyr Pro Gly Gin Ala 



90 95 



ioo y Gln Ala Ser Pro G1 n 

105 110 

Arg ser Ser Ser Ser Tyr His Val Ser Val Glu His Gln Ala Ala Ser 

120 125 

Leu Lys Val Ala Lys Ala Gln Gln Leu Ala Ala Gln Leu Pro Ala Met 

135 140 
Cys Arg Leu Glu Gly Gly Asp Ala Leu Ser Ala Ser Gln 



150 155 



!) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : not relevant 

(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Pro Gly Gln Gly Gln Gln Gly Tyr Tyr Pro Thr Ser Pro Gln Gln 



10 15 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Ser Pro Gin Gin Pro Gly Gin Gly Gin Gin Gly Tyr Tyr Pro Thr 
5 10 15 
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WHAT -IS CLAIMED IS: 

1. A method for changing a viscoelastic property of a dough, 
said method comprising introducing into said dough a HMW glutenin 
subunit comprising a non-natural repetitive domain, said domain 
comprising a plurality of HMW glutenin subunit repeats, wherein 
said repeats are synthetic in sequence or non-natural in number. 

2. A method for making a flour comprising a modified HMW 
glutenin subunit, said method comprising milling a seed comprising 
a HMW glutenin subunit comprising a non-natural repetitive domain, 
said domain comprising a plurality of HMW glutenin subunit 
repeats, wherein said repeats are synthetic in sequence or non- 
natural in number. 

3 . A HMW glutenin subunit comprising a non-natural repetitive 
domain, said domain comprising a plurality of HMW glutenin subunit 
repeats, wherein said repeats are synthetic in sequence or non- 
natural in number. 

4. A HMW glutenin subunit according to claim 3, wherein said 
non-natural repetitive domain comprises a non-natural repeat 
number. 

5. A HMW glutenin subunit according to claim 3, wherein said 
non-natural repetitive domain comprises a synthetic repeat 
sequence . 

6. A HMW glutenin subunit according to claim 3, wherein said 
non-natural repetitive domain comprises a heterologous repeat 
sequence • 
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7. A HMW glutenin subunit according to claim 3, wherein said 
non-natural repetitive domain is of a non-natural length. 

8. A wheat, flour, wheat dough, bread or noodle comprising a HMW 
glutenin subunit according to claim 3 . 

9 . A HMW glutenin subunit gene encoding a HMW glutenin subunit 
according to claim 3 . 

10. A plant or seed comprising a gene encoding a HMW glutenin 
subunit according to claim 3. 

11 . A method for making a seed comprising a HMW glutenin subunit 
according to claim 3, said method comprising growing a plant 
comprising a HMW glutenin subunit gene encoding a HMW glutenin 
subunit according to claim 1 under conditions whereby said gene 
is expressed in a seed of said plant, and harvesting said seed. 

12. A method for making a plant comprising a HMW glutenin subunit 
gene encoding a HMW glutenin subunit according to claim 3, said 
method comprising introducing said gene into said plant or a 
progenitor of said plant. 
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